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METHOD AND CONSTRUCT FOR SCREENING FOR INHIBITORS OF 
TRANSCRIPTIONAL ACTIVATION 

Field of the invention 

5 This invention relates to methods for screening and identifying inhibitors of 

promoters or inhibitors of transcriptional activation by use of a collision construct 
The collision construct provides for increased expression of reporter gene signal in the 
presence of an appropriate inhibitor. This invention, thus, relates to the collision 
construct as well as methods for making and producing the collision construct, and to 
10 vectors, host cells and kits containing the collision construct. 

Background of the Invention 
Past studies of inhibitors of gene function include studies of the inhibition of 
transcription of the gene, as described in Hsu et at. Science 254: 1799-1802 (1991) and 

15 Hsu et al. Proc. Nat'l Acad, ScL USA 90: 6395-6399 (1993). Structural genes have a 
transcription regulatory region that contains one or more sequences, referred to herein 
as response elements, mat are capable of binding to certain proteins, referred to herein 
as binding proteins, that activate or repress transcription or facilitate elongation of a 
mRNA transcript. These binding proteins include transcription factors, such as those 

20 described in Faisst & Meyer (1992), Nucleic Acids Res. 20: 3-26; THE 

ENCYCLOPEDIA OF MOLECULAR BIOLOGY, J. Kendrew. ed. (Blackwell 
Science, Oxford 1994); and those that are identified in specialized data base, as 
described in Ghosh (1993), Nucleic Acids Res. 21: 3117-3118. 

A transcription factor, such as an activator, typically contains a domain chat 

25 recognizes a specific DNA sequence, the response element, and binds it. Transcription 
activators may also contain another domain that interacts with other transcription 
factors to initiate transcription or to allow elongation of the RNA transcript. Thus, a 
molecule that inhibits binding of an activator to a response dement, either by 
competitively binding to the DNA-binding domain of the activator or to the response 

30 element, or a molecule that blocks the transcription factor-interacting domain of the 
activator, would be expected to inhibit transcriptional activation. 
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Repressors, also mostly proteins, function in a similar manner, except that 
instead of activating transcription, the repressor binds to the response element, for 
example, an operator, and blocks transcription. Molecules that axe capable of 
competitively binding to repressors activate transcription by removing the repressor 
5 from the response element. Thus, an inhibitor to the molecule that competitively binds 
to the repressor would be expected also to inhibit transcriptional activation. 

There exist other modes of molecular action which also function to inhibit 
transcription and which operate by methods distinct from classic repression or 
activation of gene transcription described above. Such methods include, for example, 
10 catalytic events directed against the mRNA of a transcription factor, it would be 
desirable to employ these methods for inhibiting transcription of a target promoter or 
transcription of a transcription factor and to devise a method to screen for such 
inhibitors. 

Conventionally, when researchers look for an inhibitor of a biological function, 
15 they look for a decrease in biological function or a decrease in a reporter signal that 
reflects inhibition of that function. Very often, a decrease in signal is difficult to 
interpret because the decrease may be the result of factors other than the presence of 
the supposed inhibitor being tested. For example, the decrease in signal may be caused 
by the presence of extraneous matter including toxic chemicals in the media, 
20 inappropriate incubation temperature, inappropriate incubation time, poor condition of 
the cells used in the test, etc. In order to resolve the matter, a number of time- 
consuming experiments have to be run with a number of controls. 

It would be desirable, therefore, if the presence of an inhibitor can be reflected 
by an increase in reporter gene signal instead of a decrease. 

25 

Summary of the Invention 
It is, therefore, an object of the present invention, to provide a screening test 
for inhibitors that is capable of generating an increase in reporter signal in the presence 
of an inhibitor. 

30 It is also an object of the present invention to provide for materials mat can be 

used in such a screening test. 
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In accordance thereto, there is provided herein a construct, termed a collision 
construct, that contains a nucleic acid molecule, comprising a first regulatory sequence 
that comprises a first promoter, a reporter gene that is under transcriptional control of 
the first promoter, where the reporter gene is capable of providing a detectable signal 
5 upon transcription and translation thereof, and a second regulatory sequence that 
comprises a second promoter, where the direction of transcription under the first 
promoter is opposite to the direction of transcription under the second promoter, where 
regulation of transcription under the second promoter alters the reporter gene signal, 
and where the first promoter is different from the second promoter. 

10 In accordance to a further object of the present im'ention, there is provided 

herein the collision construct as above, where the second promoter or the second 
regulatory sequence comprises a first response element that is capable of binding to a 
first binding protein to form a first binding pair, and the formation of the first binding 
pair regulates the activity of the second promoter. 

15 In accordance with another object of the present invention, there is provided 

herein the collision construct as above, where the last nucleotide of the stop codon of 
the reporter gene is separated from the 3* terminus of the second promoter by a 
distance of about less than about 2050 nucleotides. 

In accordance to still another object of the present invention, there is provided 

20 herein the collision construct as above, where one or both of the first promoter and 
second promoter are derived from a promoter or promoter/enhancer region of a gene 
selected from the group consisting of: a viral gene, a bacteriophage gene, a prokaryoric 
gene, and an eukaryodc gene. The eukaryotic gene can be a yeast or other fungal 
gene, an avian gene, an insect gene or a mammalian gene. Alternatively, the promoter 

25 or promoter/enhancer may be synthetically made, or partly derived and partly 
synthesized. 

In accordance to yet another object of the present invention, there is provided 
herein the collision construct as above, where one or both of the first response dement 
and the second response element, the latter optionally present in the first regulatory 
30 region, are derived from the regulatory sequence of a gene selected from the group 
consisting of: a viral gene, a bacteriophage gene, a prokaryoric gene, and an 
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eukaryotic gene. The eukaryotic gene can be, for example, a yeast or other fungal 
gene, an insect gene, an avian gene, or a mammalian gene. 

In accordance to a further object of the present invention, there is provided 
herein a method of using the collision construct as above for screening or identifying a 
5 candidate inhibitor for its ability to inhibit transcription under a target promoter, the 
method comprising the steps of providing a cell that contains the collision construct, 
where the second promoter in the construct is the target promoter and the cell is 
capable of expressing the collision construct to produce a reporter gene signal, 
determining reporter gene signal in the absence and presence of the candidate inhibitor, 
10 respectively, and comparing reporter gene signals obtained. An appropriate inhibitor 
is one that is capable of generating an increased reporter signal in the presence of an 
inhibitor. 

In accordance to another object of the present invention, there is provided 
herein a method as above, where the second promoter or the second regulatory region 

15 in the collision construct comprises a response element that is capable of binding to a 
binding protein. The binding protein can be provided by coexpression in a cell of the 
collision construct and a vector that comprises a coding sequence for binding protein. 
Alternatively, the binding protein can be provided by a cell that produces it 
consti cu lively and the collision construct is then introduced into the cell. Also, the 

20 binding protein can be added directly to the cell that contains the collision construct. 

In accordance to another object of the present invention, there is provided 
herein a method of making the collision construct by providing and linking together a 
first regulatory sequence that comprises a first promoter, a reporter gene that is capable 
of providing a detectable signal upon transcription and translation, a second regulatory 

25 sequence that comprises a second promoter, where the reporter gene is placed under 
regulatory control of the first promoter, the direction of transcription under the first 
promoter is opposite the direction of transcription of the second promoter, and the first 
promoter is different from the second promoter. 

In accordance to a further object of the present invention, there is provided 

30 herein a method of production of the collision construct by culoiring a host cell that 
comprises the collision construct, for example, a prokaryoric or eukaryotic host cell, 
for example, a bacterial or yeast cell. 
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In accordance with yet another object of the present invention, there is provided 
herein a kit that comprises the collision construct as above, or a vector or host cell 
containing the collision construct, with instructions for use thereof in accordance with 
the method described above. 

Further objects, features, and advantages of the present invention will become 
apparent from the following detailed description. It should be understood, however, 
that the detailed description, while indicating preferred embodiments of the invention, 
is given by way of illustration only, since various changes and modifications within the 
spirit and scope of the invention will become apparent to those skilled in the art from 
this detailed description. 

Brief Description of the Drawings 

FIG. I is a schematic representation of a collision construct, transformed in 
HeLa cells, containing the CMV promoter and the HIV-1 promoter running in opposite 
directions and another construct for control in which the HIV-1 promoter was absent. 
The graph depicts Tat-dependent inhibition of CMV promoter activity over a range of 
Tat levels, in micrograms of Tat expression vector, from about 0 to 2. The symbol (♦) 
represents die alkaline phosphatase gene expression in the collision construct. The 
symbol (0) represents alkaline phosphatase gene expression in the control construct 
showing nonspecific reduction of CMV promoter activity by Tat protein. Other 
abbreviations include AP, representing alkaline phosphatase; and CMV, representing 
the cytomegalovirus promoter/enhancer. 

FIG. 2 is a schematic representation of five different collision constructs, 
designated as #1152, #1161, #1225, #1162, and #1163, and the reporter gene signals 
in percent alkaline phosphatase activity, generated by expression of 1 ug DNA each, 
respectively, and determined in the presence of 0.5 ug of Tat protein expression 
plasmid (" +Tat*) or 0.5 ug inactive Tat protein expression plasmid ("-Tat") in HeLa 
cells. FIG. 2 shows that specific reduction of alkaline phosphatase expression in the 
presence of HIV-1 Tat protein is dependent on a functional TAR sequence in the LTR. 
When the collision construct #1 152 was used, in the presence of inactive Tat plasmid, 
reporter gene signal was high. This signal was suppressed in the presence of active Tat 
plasmid by about 6056. When a portion of the TAR sequence was deleted (construct 
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#1 161), or when further portions of the promoter were deleted, there was no 
significant difference in reporter gene signal in the presence or absence of TaL Thus, 
suppression of alkaline phosphatase expression in the collision construct is Tat protein 
and TAR sequence dependent. 
5 FIG. 3 is a schematic representation of five different reporter constructs, 

designated as #1085, #1166, #1213, #1167, and #1168, and the reporter gene signals, 
indicated as percent alkaline phosphatase activity, generated by expression of 1 ug of 
each, respectively, in the presence of 0.5 ug Tat protein expression plasmid C+TaT) 
or 0.5 ug inactive Tat protein expression plasmid ("-Tat") in HeLa cells. The alkaline 

10 phosphatase activity measured with construct #1085 in the presence of active Tat 
protein expression plasmid was set to 100%. FIG. 3 illustrates that HI V-l promoter 
activation by Tat protein requires the presence of the TAR sequence and that TAR may 
possibly have a silencer function because when a portion of TAR was deleted, reporter 
gene signal in the absence of Tat increased as compared to that when the complete 

15 TAR sequence was present. Addition of Tat to cells containing this construct, #1 166, 
which does not allow the formation of a TAR stem loop structure, enhanced reporter 
gene signal by about 2.5-fold. Additional deletion of the promoter region to include 
the TATA box and Spl binding sites resulted in complete loss of reporter gene signal, 
in the absence or presence of Tat. 

20 FIG. 4 is a schematic representation of different collision constructs containing 

spacer regions of about 21, 94, 153, 406, 556 and 2047 nucleotides, positioned 
between the 3' end of the AP coding sequence, as defined by the stop codon TAA, or 
TGA in construct with a spacer of 21 nucleotides, and the end of the TAR sequence at 
nucleotide +59 in the HIV- 1 promoter, with + 1 nucleotide as the start of 

25 transcription. The sequence of this junction is shown in FIG. 5. 

FIG. 5 shows an argument map of the DN A sequence of the 3* end of the AP 
gene and the 5' end of the mutant HIV-1 promoter in the collision construct #1 152. 
The stop codon of the AP coding region is indicated. Start of transcription in the HIV- 
1 promoter at position 263 is indicated with + 1 . Nucleotide substitution st position 

30 268 (substitution of T to Q, position 271 (substitution of C to A), and position 344 
(substitution of A to C) are shown. In addition, an insertion of a T nucleotide at 
position 301 is also shown. 
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FIG. 6 indicates that TAR decoys act as specific inhibitors of HIV-1 
transcription. FIG 6a shows inhibition of HIV-1 transcription in the presence of TAR 
decoys. At the top of FIG 6a is a schematic diagram of an HIV-AP reporter gene 
construct and a plasmid expressing multimerized (8 copies) of transactivation response 
5 sequence, also called TAR decoys. The arrows indicate the direction of transcription 
and translation. The black boxes represent the TAR sequence. At the bottom of FIG. 
6a, a schematic indicates that HcLa cells were transfected with various combinations of 
HIV-AP (lug) reporter and TAR expression plasmids (0.5ug) in the presence or 
absence of 0. 5ug of Tat expression vector. AP activity was determined as described 

10 previously and is expressed as fold activation relative to the level obtained with the 
HIV-AP plasmid in the absence of Tat (represented in lane 1). FIG 6b shows 
increased reporter gene expression in the collision construct by inhibition of HIV- 1 
promoter activity. At the top of FIG 6b is a schematic representation of th collision 
construct and a plasmid expressing multimerized copies of the TAR sequence. At the 

15 bottom of FIG. 6b, HeLa cells were transfected with various combinations of the 

collision construct (lug) and increasing amounts of a TAR expression plasmid (lanes 3 
to 5, lug and 2ug respectively) in the absence or presence of 0.5 ug Tat expression 
plasmid. The total DNA concentration in each experiment was kept constant by adding 
a Tat expression vector containing a premature stop codon and a pBJ plasmid 
20 expressing an unrelated (leptin) gene. Each column represents the mean of at least 
three independent experiments. Error bars represent standard error from multiple 
transfections. 



Detailed Description of the Preferred Embodiments 
25 The invention described herein draws on previously published work and, at 

times, on pending patent applications. By way of example, such work consists of 
scientific papers, abstracts, or issued patents, and published patent applications. All 
published work cited herein are hereby incorporated by reference. 

The inventors herein have discovered that a collision construct can be made that 
30 can be used for screening inhibitors of promoter or transcriptional activity. By use of 
this collision construct, the presence of a desired inhibitor is indicated by an 
enhancement in reporter gene signal. 
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For a better understanding of the present invention, the terms used herein have 
the following definition: 

A "nucleic acid molecule" or "nucleic acid sequence" refers Co either a DNA 
sequence, a RNA sequence, or complementary strands thereof that comprise a 
5 nucleotide sequence. 

A "regulatory sequence" refers to a nucleic acid sequence encoding one or more 
elements that are capable of affecting or effecting expression of a gene sequence, 
including transcription or translation thereof, when the gene sequence is placed in such 
a position as to subject it to the control thereof. Such a regulatory sequence can be, 

10 for example, a minimal promoter sequence, a complete promoter sequence, an 

enhancer sequence, an upstream activation sequence ("UAS"), an operator sequence, a 
downstream termination sequence, a polyadenylauon sequence, an optimal 5* leader 
sequence to optimize initiation of translation, and a Shine-Dalgarno sequence. 
Alternatively, the regulatory sequence can contain a combination enhancer/promoter 

15 element. The regulatory sequence that is appropriate for expression of the present 
construct differs depending upon the host system in which the construct is to be 
expressed. Selection of the appropriate regulatory sequences for use herein is within 
the capability of one skilled in the art. For example, in prokaryotes, such a regulatory 
sequence can include one or more of a promoter sequence, a ribosomal binding site, 

20 and a transcription termination sequence. In eukaryotes, for example, such a sequence 
can include one or more of a promoter sequence and/or a transcription tenrunation 
sequence. If any necessary component of a regulatory sequence that is needed for 
expression is lacking in the collision construct, such a component can be supplied by a 
vector into which the collision construct can be inserted for transformation or 

25 remtroducnon into a host cell. Regulatory sequences suitable for use herein rruy be 
derived from any source including a prokaryotic source, an eukaryotic source, a virus, 
a viral vector, a bacteriophage or a linear or circular plasmid. An example of a 
regulatory sequence is the human uiuusnodeficiency virus ("HIV 1") promoter that is 
located in the U3 and R region of the HIV- 1 long terminal repeat ("LTR"). 

30 Alternatively, the regulatory sequence herein can be a synthetic sequence, for example, 
one made by combining the UAS of one gene with the remainder of a requisite 
promoter from another gene, such as the GADP/ADH2 hybrid promoter. 
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A "minimal promoter" is a naturally occurring promoter that has been 
weakened so that it is not 100% active. For example, a promoter in which all but a 
TATA box has been deleted, such as the minimal fos promoter, as described in 
Berkowitz et al. (1989) Mol. Cell. Biol. 5:4272-42*1. 
5 A "reporter gene- refers to a nucleic acid molecule that encodes a polypeptide 

that is capable of providing a detectable signal either on its own upon transcription or 
translation or by reaction with another one or more reagents. Reporter genes suitable 
for use herein are conventional in the art, selection of which is within the capability of 
one skilled in the an. Examples of such reporter genes include that encoding the 

10 enzyme chloramphenicol acetyltransferase ("CAT"), the luc gene from the firefly that 
encodes luciferase, the bacterial lacZ gene from Escherichia coli mat encodes 0- 
galactosidase, alkaline phosphatase ("AP"), human growth hormone ("hGH"), the 
bacterial p -glucuronidase ("GUS"), and green fluorescent protein ("GFP"). as 
described in Ausubel et al.. CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 

15 (1994), (Greene Publishing Associates and John Wiley & Sons, New York, N.Y.). 

A "response element" refers to a region of a nucleic acid molecule, usually, 
from a regulatory region of a gene, that is capable of specifically binding to a binding 
protein, such as an activator molecule, for activation of transcription or for allowing 
the elongation of a RNA transcript, or a repressor molecule, for inhibition of 

20 transcription. Some response elements are known in the art. Selection of a response 
element that is suitable for use herein is within the capability of one skilled in the art. 

A "binding protein" herein refers to a protein that is capable of specifically 
binding to a response element for regulation of transcription. Some binding proteins 
are known. Selection of a binding protein suitable tor use herein is also within the 

25 capability of one skilled in the art A number of DNA binding proteins as well as 
response elements of the transcription regulatory regions are described in Wingender 
(1988), Nucleic Acids Res. 16: 1879-1902; Molecular Cell Biology, J. Darnell, H. 
Lodish & D. Baltimore, (Scientific American Books, New York 1990); and Dhawaie 
& Lane (1993), Nucleic Adds Res. 2J:5537-5546\ One example is the Tat/TAR 

30 combination found in viruses such as human immuaodefaaency virus-1 ("HIV-1 "), 
. human immunodeficiency virus-2 ("HTV-2"), and simian immunodeficiency virus 
("SIV"). In these viruses, mutr-activator, "Tat", is the binding protein referred to 
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herein and rra/tf-acuvating response element, TAR", is the response element referred 
to herein, as described in Jones & Peter! in (1994), Ann. Rev. Biochem. 63:717-743; 
and Antoni et at. (1994), Adv. Virus Res. 4J.53-145. Examples of response elements 
and binding proteins besides TAR and Tat include Rev response dement ("RRE"), a 
5 NF-icB binding site, a Spl binding site, and GaM and LexA binding sites. Examples 
of binding proteins include Tat, Rev, NF-kB, Spl , Gal 4. and LexA. 

The term "binding pair" refers to a pair of molecules, including a DNA/DNA 
pair, DNA/RNA pair, protein/DNA pair, protein/RN A pair, and a protein/protein pair 
in which the components of the pair bind specifically to each other with a higher 
10 affinity than to a random molecule, such that upon binding, the pair triggers a 

biological response, such as activation of transcription or where the binding protein is a 
repressor, suppresses a biological response, that is, transcription. 

The term "specific binding" in reference to interaction between two molecules 
indicates a higher affinity binding and a lower dissociation constant than non-specific 
15 binding, thus, distinguishing specific binding from background binding. 

The term "regulates," in the context of transcription, denotes both positive and 
negative regulation. Positive regulation is exemplified by activation. Negative 
regulation is exemplified by repression. 

Although the methodology described below is believed to contain sufficient 
20 details to enable one skilled in the art to practice the present invention, other constructs 
not specifically exemplified, such as piasmids, can be constructed and purified using 
standard recombinant DN A techniques as described in, for example, Sambrook et aL 
(1989), MOLECULAR CLONING: A LABORATORY MANUAL, 2nd cd. (Cold 
Spring Harbor Press, Cold Spring Harbor, New York); and under current regulations 
25 described in United States Department of HEW, NATIONAL INSTITUTE OF 
HEALTH (NIH) GUIDELINES FOR RECOMBINANT DNA RESEARCH. 

In one embodiment of the present invention, therefore, the collision construct 
comprises a reporter gene coding sequence that is linked at its 5 9 end to a first 
regulatory sequence that comprises a first promoter such that the reporter gene is 
30 placed under transcriptional regulatory control of the first promoter. The reporter gene 
is linked at its 3* end to a second regulatory sequence that comprises a second 
promoter in such a fashion that transcriptional activity of the second promoter 
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interferes with the transcriptional activity of the first promoter. This can be done, for 
example, by placing the first regulatory sequence and the second regulatory sequence 
in such a manner that transcription under the second promoter proceeds in a direction 
opposite to direction of transcription under the first promoter. 
5 The collision construct herein can be used, for example, for screening or 

identification of an inhibitor of transcriptional activity. For this use, a collision 
construct as described above is made with the promoter to be inhibited (hereafter "the 
target promoter") as the second promoter. The collision construct so made is inserted 
into a vector for expression, with or without the use of linker elements. The 

10 recombinant vector is then introduced into a compatible host cell that can effect the 
expression of the reporter gene. There are known vectors and host ceils that can be 
used for these purposes, as described in greater detail below. 

The regulatory sequences suitable for use herein can be any regulatory sequence 
that is compatible for use with the promoters for expression in a desired host cell. For 

15 example, if the collision construct contains a mammalian gene promoter, a regulatory 
sequence derived from mammalian systems would be desirable. The regulatory 
sequence can be a sequence naturally associated with the promoters selected for use 
herein, or can be a synthetic sequence, or partly synthetic or partly derived. 

The promoters suitable for use herein can be any promoter, including those that 

20 are constitutively active or those that are inducible or regulatable. The promoters can 
be naturally derived or synthetically made. They can be derived from any genes, viral, 
prokaryotic or eukaryotic. The eukaryotie genes can be yeast or other fungal, insect, 
mammalian or avian genes. In a preferred embodiment, the target promoter is derived 
from a virus or a tumor cell- Examples of suitable promoters are described below in 

25 the portion relating to expression systems. 

A suitable promoter for use as the first promoter in the present collision 
construct is one that possesses a transcriptional activity that is about the same strength 
as mat of the second promoter. If the first promoter is comparatively much stronger 
* than the second promoter, inhibition of reporter gene signal by the presence of the 

30 second or target promoter may be low, and an enhanced reporter gene signal in the 
presence of an inhibitor may be difficult to detect 
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Thus, if an available promoter to be used in the collision construct as a first 
promoter is too strong, it may be desirable to weaken the promoter by deleting parts 
thereof to generate a minimal promoter. This can be done by a number of methods 
including, for example, restriction enzyme digestion. An example of a weakened 
5 promoter is one in which all but the TATA box is deleted. A promoter is considered 
too strong herein if it drives the expression of the reporter gene to about the same level 
regardless of the presence or absence of the second promoter that drives transcription 
in the opposite direction. 

When the target promoter to be inhibited is constitutively active, a reporter 

10 gene signal expressed by a transformed host cell containing the collision construct can 
be first established in the absence of any inhibitors. A candidate inhibitor can then be 
introduced or added to the cells and the reporter gene expression can be monitored. 
Alternatively, the transformed cells containing the collision construct can be placed in 
a panel of microliter wells and a panel of candidate inhibitors can be added to the cells, 

IS one inhibitor to each well. A suitable inhibitor is one that generates an enhanced 
reporter signal in its presence as compared with the signal produced in its absence. 

The target promoter to be used herein includes promoters that are subject to 
regulation, such as activation, by the binding of a binding protein to a response 
element in the proximity of the target promoter. The response element can be 

20 naturally present in the target promoter or can be artificially linked to the target 

promoter. Thus, the present collision construct can be used to identify an inhibitor that 
can inhibit activation of the target promoter by inhibiting binding between the binding 
protein and the response element This can be achieved by identifying an inhibitor that 
competitively binds either to the binding protein or to the response element. When the 

25 activity of the second promoter is inhibited, the reporter gene activity would return to a 
level similar to that in the absence of activation by the binding protein. 

The binding protein can be introduced into the cells by addition thereof to the 
medium containing the cells and gently scraping the cells from the culture dish. 
Alternatively, the binding protein can be provided in the form of a vector containing 

30 the coding sequence of the binding protein and regulatory sequences that would allow 
expression thereof. The vector can be introduced into the host cell either at, before, or 
after introduction of the collision construct into the cell. In another embodiment of the 
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present invention, stable cell lines containing a collision construct or the coding 
sequence of the binding protein can be first established, and the other sequence 
introduced later. Further alternatively, a stable cell line containing both the collision 
construct and the binding protein can be made and thereafter used for screening 
inhibitors. 

The collision construct or vector containing the collision construct can be 
introduced into host cells by conventional techniques including dectroporation, 
calcium phosphate treatment, and lipofec taurine trans fection. The target promoter can 
be a known promoter with a known nucleotide sequence that can be synthetically made 
or derived from a natural source such as a viral gene, a tumor cell gene or a fungal 
gene, for example. Typically, such promoters are excised from the natural source and 
inserted into the collision construct by use of restriction enzymes and/or linkers. 

Alternatively, the sequence of the promoter may not be precisely known, but 
the general location of the promoter is known, for example, the promoter can be 
known to reside in a particular restriction fragment. In this instance, the restricted 
fragment can be used as the second regulatory sequence of the present collision 
construct. 

In another embodiment of the present invention, it may be desirable to turn off 
the transcription of certain genes that are yet unidentified, for example, one responsible 
for production of a cancerous cell, even though the gene or genes responsible for this 
condition have not been identified. For this purpose, mRNA can be isolated from the 
tumor cell and compared to that obtained from normal cell by a common procedure 
known as subtractive hybridization. By substractive hybridization it can be determined 
which mRNA is present in the tumor cell but absent in normal cell. A cDNA molecule 
can be constructed based on the mRNA so obtained, and a fragment of the genomic 
DNA containing promoter activity can be isolated and used as a target promoter in the 
present collision. 

The collision construct herein can be inserted into a suitable vector for 
introduction into a host cell for expression and use thereof. A person skilled in the an 
would be able to select such a vector and host cell for such purposes. Moreover, 
examples of suitable vectors and host cells are described in greater detail below. 
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The reporter gene that is suitable for use herein can be any reporter gene that 
can be expressed in the desired host expression system, as described previously. For 
example, the reporter gene can be (J-galactosidase, among others. 

Similarly, the response element suitable for use herein can be any response 
5 element to which inhibition is desired. Examples of such response elements are as 
described above. The response element herein may be part of the promoter sequence 
by conventional techniques such as by synthesis of excision of a known sequence by 
restriction enzyme and linked to die promoter sequence with or without the use of 
linkers. 

10 The binding proteins for use herein may be any binding protein as described 

above. Such binding proteins may be added to the cells containing the collision 
construct for use in screening inhibitors. In doing so, the cells can be scraped off the 
culture dish or well and mixed with the added binding protein. 

Alternatively, the binding proteins can be introduced into the cell in the form of 

15 a vector containing the coding sequence of the binding protein and allowing the 
expression of the coding sequence. In this manner, a stable cell line containing the 
binding protein can be made and used for screening inhibitors. In another embodiment 
of the present invention, a cell that constinirively produces the binding protein 
constitutiveiy can be used. 

20 In a further embodiment of the present invention, a stable cell line containing 

the collision construct can be made. This can be done by introduction of the collision 
construct into a host cell, by conventional techniques such as dec tropo ration, calcium 
phosphate treatment, and lipofectarnine or transformation, and selecting a cell or cell 
line that stably expresses the collision construct 

25 A candidate inhibitor to be tested for its inhibitory activity on a target 

promoter, or on transcriptional activity can be added to a cell harboring the collision 
construct in which the target promoter is the second promoter of the construct, and 
optimally, is desired, providing the cell also with a binding protein. Expression of 
reporter gene signal is observed and compared in the absence and in the presence of the 

30 candidate inhibitor, respectively. 

Besides testing a single candidate inhibitor, stably transformed cell lines 
containing the collision construct and optionally containing a vector containing the 
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coding sequence of a binding protein can be placed in irucrotiter wells and a panel of 
inhibitors is added thereto- Reporter gene signals are also observed and compared in 
the absence and presence of the candidate inhibitors, respectively. 

In a further embodiment of the present invention, a method is provided for 
5 screening inhibitors, such as, for example, inhibitors to promoters and transcriptional 
activators. Promoters that can be used for screening inhibitors and used in the 
collision construct can be any desired promoters including, for example, promoters 
from viruses and cancer cells, bacteria and fungi. Transcriptional activators that can 
be inhibited can be any desired transcriptional activator including, for example. Tat, 

10 Rev, NFkB and Spl. The region of the promoter that can be inhibited herein can be 
any region that binds transcription factors including, for example, TAR, RRE (Rev 
response element), NFkB binding site and Spl binding site. 

An embodiment of the present invention can be tailored to screen in vivo in 
cells a random library of ribozymes for those ribozymes which act as inhibitors of 

15 transcription. Ribozymes may act by catalytically interrupting transcription by 

targeting an RNA molecule of a transcription factor that interacts with the promoter or 
by targeting the mRN A of a reporter gene. However, the use of the invention for 
screening ribozyme libraries is not limited to any theory of ribozyme function. Unlike 
inhibitors of transcription which inhibit the promoter by interfering with a promoter- 

20 transcription factor interaction, a DNA-protein interaction, ribozymes catalytically 
disable an RNA molecule. In the context of the present invention, ribozymes which 
inhibit die second promoter in the collision construct can be selected from random 
synthetically derived ribozyme libraries by enhanced reporter gene signal, indicating 
that a ribozyme is acting to disable the second promoter, or the mRN A for a 

25 transcription factor that interacts with that promoter. 

In one embodiment of the present invention, the subunits of a collision 
construct, including a first regulatory sequence, a reporter gene, optionally, a response 
element or elements, and a second regulatory sequence, can all be obtained from 
known sources using conventional techniques of restriction enzyme digestion to remove 

30 these elements from such sources. Alternatively, these subunits can be made 

syrithetically by chemical synthesis or serni-synrhetically by isolating parts thereof from 
known sources and either combining them or by combining them and making any 
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missing parts synthetically. Once obtained or made, these submits can be linked 
together, for example, by use of known linker sequences, so as to place the reporter 
gene under regulatory control of the first regulatory sequence, with the direction of 
transcription going in one direction, 5* to 3* and the second regulatory sequence under 
S regulatory control of the response element, with the direction of transcription by the 
second regulatory sequence running in a direction, 5' to 3\ but opposite that of the 
first regulatory sequence. Placement of the response element and the second 
regulatory sequence is such that activation of transcription of the second regulatory 
sequence reduces the reporter gene signal upon transcription and translation thereof, 

10 presumably as a result of collision between the two transcription units. A mechanism 
by which the second regulatory sequence generates an anti-sense message that blocks 
translation of the reporter gene cannot be ruled out. 

The spacing between the reporter gene and the response element can be varied 
to attain the desired level of inhibition of reporter gene activity. In one embodiment of 

15 the present invention, the spacing between the 3* end of the reporter gene and the + 1 
nucleotide of the promoter of the second regulatory sequence is less than 2200 
nucleotides. Preferably, this spacing is less than 1000 nucleotides; more preferably, it 
is less than 800 nucleotides. Most preferably, the spacing is between about 600 
nucleotides and about 20 nucleotides. In particular, spacings of about 21, 94, 153, 

20 406, and 556 base pairs are preferred. In an alternative embodiment, the target or 
second promoter of the collision construct can be optimally placed at a distance of up 
to 1500 base pairs from the 3* terminus of the first promoter. Thus, a reporter gene is 
selected that comprises a sequence that is shorter than or the same as this optimal 
distance. 

25 The first response element can also be linked to the second regulatory sequence 

using linker sequences or the combined first response element and second regulatory 
sequence can be removed from a known source, again by restriction enzyme digestion. 

The response element will usually be placed at the 5' terminus of the second 
regulatory region, in accordance with the nature of most promoters which would 

30 comprise the second regulatory regions of this invention. However, for example, 
when the second regulatory region is comprised of a promoter for which it is 
appropriate to place the response clement at the 3* terminus, the response element will 
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be most appropriately placed at the 3* terminus. Preferably, this response element is 
placed at its natural position in juxtaposition to the promoter being used. For example, 
when the HIV-1 LTR promoter is used, the response element, TAR, is situated 3* to 
the + 1 nucleotide of the promoter. In other primate immunodeficiency viruses and in 
5 a subset of related nonprimate lenti viruses, the response element will also be most 
appropriately positioned at the 3* terminus of the second regulatory region. See for 
example a discussion of the characteristics of die promoters of such viruses in Gullen, 
Cell (1993) 73:417-420. The response element herein can also be rrujltimerized to 
produce a more dramatic effect. An example of a response element that has been 

10 multimerized is the (tet-op]? which is an operator responsive to tetracycline induction. 

Once made, the collision construct can be introduced into an appropriate host 
cell for expressions thereof, including prokaryotic system such as bacterial, or 
eukaryotic system, such as yeast, insect cell system, or mammalian system, such as 
those described below. The binding protein may also be expressed in the expression 

15 systems described below. 

Expression in Bacterial Cells 

Control elements for use in bacteria include promoters, optionally containing 
operator sequences, and ribosome binding sites. Useful promoters include sequences 

20 derived from sugar metabolizing enzymes, such as galactose, lactose (lac) and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes 
such as tryptophan (trp), the ^-lactamase (bid) promoter system, bacteriophage XPL, 
and T7. In addition, synthetic promoters can be used, such as the roc promoter. The 
^lactamase and lactose promoter systems are described in Chang et al., Nature (1978) 

25 275: 615, and Goeddel et al.. Nature (1979) 281: 544; the alkaline phosphatase, 

tryptophan (trp) promoter system are described in Goeddel et at., Nudeic Adds Res. 
(1980) 8: 4057 and EP 36,776 and hybrid promoters such as the toe promoter is 
described in U.S. Patent No. 4,551,433 and de Boer ctal., Proc. Natl Acad. So. 
USA (1983) 80: 21-25. However, other known bacterial promoters useful for 

30 expression of eukaryotic proteins are also suitable. A person skilled in the art would 
be able to operably ligate such promoters to the coding sequences of interest, for 
example, as described in Siebenlist etal.. Cell (1980) 20: 269, using linkers or 
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adaptors to supply any required restriction sites. Promoters for use in bacterial 
systems also generally will contain a Shine-Dalgamo (SO) sequence operabty linked to 
the DNA encoding the target polypeptide. For prokaryotic host cells that do not 
recognize and process the native target polypeptide signal sequence, the signal 
5 sequence can be substituted by a prokaryotic signal sequence selected, for example, 
from the group of the alkaline phosphatase, penicillinase, lpp, or heat stable 
enterotoxin II leaders. The origin of replication from the plasmid pBR322 is suitable 
for most Gram-negative bacteria. 

The foregoing systems are particularly compatible with Escherichia coli. 

10 However, numerous other systems for use in bacterial hosts including Gram-neganve 
or Gram-positive organisms such as Bacillus spp. , Streptococcus spp. , Streptomyces 
spp. , Pseudomonas species such as P. aeruginosa. Salmonella typhimurium. or 
Serratia marcescans, among others. Methods for introducing exogenous DNA into 
these hosts typically include the use of C2CI2 or other agents, such as divalent cations 

IS and DMSO. DNA can also be introduced into bacterial cells by electroporation, 
nuclear injection, or protoplast fusion as described generally in Sambrook et al. 
(1989), MOLECULAR CLONING: A LABORATORY MANUAL, 2d edition (Cold 
Spring Harbor Press, Cold Spring Harbor, N.Y.). These examples are illustrative 
rather man limiting. Preferably, the host cell should secrete minimal amounts of 

20 proteolytic enzymes. Alternatively, in vitro methods of cloning, e.g., PCR or other 
nucleic acid polymerase reactions, are suitable. 

Prokaryotic cells used in this invention are cultured in suitable media, as 
described generally in Sambrook et al. (1989), MOLECULAR CLONING: A 
LABORATORY MANUAL, 2d edition (Cold Spring Harbor Press, Cold Spring 

25 Harbor, N.Y.). 

Expression in yeast cells 

Expression and transformation vectors, either extrachrornosornal replicons or 
integrating vectors, have been developed for trartsforrnation into many yeasts. For 
30 example, expression vectors have been developed for, among others, the following 

yeasts: Saccharomyces cerevisiae ,as described in Hitmen et aL, Proc. NatL Acad. Set. 
USA (1978) 75:1929; Ito et al.. J. Bacterial. (1983) 755:163; Candida albicans as 
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described In Kurtz et al.. Mol. Cell. Biol. (1986) 6: 142; Candida maltosa. as 
described in Kunze et al.. J. Basic Microbiol. (1985) 25:141; Hansenula pofymorpha, 
as described in Gleeson et al.. J. Gen. Microbiol (1986) /J2.-3459 and Roggenkamp 
et at, Mol. Gen. Genes. (1986) 202:302); Ktoyveromyces fiugilis. as described in Das 
5 et al.. J. Baaeriol. (1984) 755:1165; Kluyveromyces lactis. as described in De 
Louvencouft et al.. J. Baaeriol. (1983) 754:737 and Van den Berg et al.. 
BiofTechnology (1990) 5:135; Pichia guiUerimondii. as described in Kunze et al.. J. 
Basic Microbiol. (1985) 25: 141 ; Pichia pastoris, as described in Cregg et al.. Mol. 
Cell. Biol. (1985) 5:3376 and U.S. Patent Nos. 4,837,148 and 4,929,555; 

10 Schizosaccharomyces pombe. as described in Beach and Nurse, Nature (1981) 

300.106; and Yarrowia lipotytica, as described in Davidow et al.. Curr. Genet. (1985) 
70:380 and Gaillardin et al.. Curr. Genet. (1985) 70:49, Aspergillus hosts such as A. 
nidulans, as described in Bailance et al.. Biochem. Biophys. Res. Commun. (1983) 
772:284-289; Tilbum et al.. Gene (1983) 26:205-221 and Yelton et al.. Proc Natl 

15 Acad Set. USA (1984) 57:1470-1474, and A. niger. as described in Kelly and Hynes. 
EMBOJ. (1985) 4:475479; Trichoderma reesia. as described in EP 0 244 234, and 
filamentous fungi such as, e.g. Neurospora. Penicillium, Tofypocladium. as described 
in WO 91/00357. 

Control sequences for yeast vectors are known and include promoters regions 
20 from genes such as alcohol dehydrogenase (ADH). as described in EP 0 284 044, 
enolase, glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate- 
dehydrogenase (GAP or GAPDH), hexokinase, phosphofructokinase, 3- 
phospboglycerate mutase, and pyruvate kinase (PyK), as described in EP 0 329 203. 
The yeast PHOS gene, encoding acid phosphatase, also provides useful promoter 
25 sequences, as described in Myanohara et al. Proc. Natl Acad. Set. USA (1983) 80: 1. 
Other suitable promoter sequences for use with yeast hosts include the promoters for 3* 
phosphoglyceraie kinase, as described in Hitzeman et al., J. Biol Chan. (1980) 
255:2073, or other glycolytic enzymes, such as pyruvate decarboxylase, 
triosephosphate isomerase, and phosphoglucose isomerase, as described in Hess etal. 
30 / Adv. Enzyme Reg. (1968) 7: 149 and Holland et al. Biochemistry (1978; 77:4900. 
Inducible yeast promoters having the additional advantage of transcription controlled 
by growth conditions, include those from the fist above and others including the 
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promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, 
degradauve enzymes associated with nitrogen metabolism, metaJlothionein, 
glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and 
galactose utilization. Suitable vectors and promoters for use in yeast expression are 
5 further described in Hitzeman, EP 0 073 657. Yeast enhancers also are 

advantageously used with yeast promoters. In addition, synthetic promoters which do 
not occur in nature also function as yeast promoters. For example, upstream activating 
sequences (U AS) of one yeast promoter may be joined with the transcription activation 
region of another yeast promoter, creating a synthetic hybrid promoter. Examples of 

10 such hybrid promoters include the ADH regulator}' sequence linked to the GAP 
transcription activation region, as described in U.S. Patent Nos. 4,876,197 and 
4,880,734. Other examples of hybrid promoters include promoters which consist of 
the regulatory sequences of either the ADH2, GALA, GALI0. or PHOS genes, 
combined with the transcriptional activation region of a glycolytic enzyme gene such as 

15 GAP or PyK, as described in EP 0 164 556. Furthermore, a yeast promoter can 

include naturally occurring promoters of non-yeast origin that have the ability to bind 
yeast RNA polymerase and initiate transcription. 

Other control elements which may be included in the yeast expression vectors 
are terminators, for example, from GAPDH and from the enolase gene, as described in 

20 Holland et aL. J. Biol. Chan. (1981) 256: 1385, and leader sequences which encode 
signal sequences for secretion. DNA encoding suitable signal sequences can be 
derived from genes for secreted yeast proteins, such as the yeast irrvertase gene as 
described in EP 0 012 873 and JP 62,096,086 and the a-factor gene, as described in 
U.S. Patent Nos. 4,588,684, 4,546,083 and 4,870,008 and EP 0 324 274 and WO 

25 89/02463. Alternatively, leaders of non-yeast origin, such as an interferon leader, also 
provide for secretion in yeast, as described in EP 0 060 057. 

Methods of introducing exogenous DNA into yeast hosts are well known in the 
art, and typically include either the transformation of spheroplasts or of intact yeast 
cells treated with alkali cations. Transformations into yeast can be carried out 

30 according to the method described in Van Solingen et al., J. BacL (1977) 730:946 and 
Hsiao <r al., Proc NatL Acad. Sd. USA (1979) 76:3829. However, other methods for 
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introducing DNA into cells such as by nuclear injection, dectropo ration, or protoplast 
fusion may also be used as described generally in Sambrook et al., cited stove. 

For yeast secretion the native target polypeptide signal sequence may be 
substituted by the yeast invertase, a- factor, or acid phosphatase leaders. The origin of 
5 replication from the 2 u plasmtd origin is suitable for yeast. A suitable selection gene 
for use in yeast is the trp\ gene present in the yeast plasmtd described in Kingsman a 
al. § Gene (1979) 7: 141 or Tschemper et al., Gene (1980) 10:157. The trp\ gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in 
tryptophan. Similarly, Leu2-derlclent yeast strains (ATOC 20,622 or 38,626) are 
10 complemented by known plasmids bearing the Leu2 gene. 

For intracellular production of the present polypeptides in yeast, a sequence 
encoding a yeast protein can be linked to a coding sequence of the desired polypeptide 
to produce a fusion protein that can be cleaved intracellularly by the yeast cells upon 
expression. An example, of such a yeast leader sequence is the yeast ubiquirin gene. 

15 

Expression in Insect Cells 

Baculovirus expression vectors (BEVs) are recombinant insect viruses in which 
the coding sequence for a foreign gene to be expressed is inserted behind a baculovirus 
promoter in place of a viral gene, e.g., polybedrin, as described in Smith and 

20 Summers, U.S. Pat No., 4,745,051. 

An expression construct herein includes a DNA vector useful as an intermediate 
for the infection or transformation of an insect cell system, the vector generally 
containing DNA coding for a baculovirus transcriptional promoter, optionally but 
preferably, followed downstream by an insect signal DNA sequence capable of 

25 directing secretion of a desired protein, and a site for insertion of the foreign gene 
encoding the foreign protein, the signal DNA sequence and the foreign gene being 
placed under the transcriptional control of a baculovirus promoter, the foreign gene 
herein being the coding sequence of die desired polypeptide. 

The promoter for use herein can be a baculovirus transcriptional promoter 

30 region derived from any of the over 500 baculovtruses generally infecting insects, such 
as, for example, the Orders Lepidoptera, Dtptera, Orthopcera, Coleoptera and 
Hymenoptera including, for example, but not limited to the viral DNAs ofAutographo 
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caUfomica MNPV, Bombyx mod NPV, rrichoplusia ni MNPV. Rachlplusia ou MNPV 
or GaUeria melloneUa MNPV, Andes aegypti, Drosophtla rnelanogaster, Spodaptera 
frugiperda, and Trichoplusia id. Thus, Che baculovirus transcriptional promoter can be, 
for example, a baculovirus immediate -early gene I EI or I EN promoter; an immediate- 
5 early gene in combination with a baculovirus delayed-early gene promoter region 
selected from the group consisting of a 39K and a Hindi!! fragment containing a 
delayed-early gene; or a baculovirus late gene promoter. The immediate-early or 
delayed-early promoters can be enhanced with transcriptional enhancer elements. 
Particularly suitable for use herein is the strong polyhedrin promoter of the 

10 baculovirus, which directs a high level of expression of a DN A insert, as described in 
Friesen et al. (1986) "The Regulation of Baculovirus Gene Expression'' in: THE 
MOLECULAR BIOLOGY OF BACULOV1RUSES (W. Doerfler, ed ); EP 0 127 839 
and EP 0 155 476; and the promoter from the gene encoding the plO protein, as 
described in Vlak et a!.. J. Gen. Virol. (1988) 69:765-776. 

15 The plasmid for use herein usually also contains the polyhedrin polyadenylation 

signal, as described in Miller et a!., Ann. Rev. Microbiol. (1988) 42:177 and a 
procaryotic anpic ill in- resistance (amp) gene and an origin of replication for selection 
and propagation in E. coli. DNA encoding suitable signal sequences can also be 
included and is generally derived from genes for secreted insect or baculovirus 

20 proteins, such as the baculovirus polyhedrin gene, as described in Carbonell et a!., 
Gene (1988) 75:409, as well as mammalian signal sequences such as those derived 
from genes encoding human a-imerferon as described in Maeda et al.. Nature (1985) 
575:592-594; human gastrin- releasing peptide, as described in Lebacq-Verheyden et 
aL, Mol Cell. Biol. (1988) 8:3129; human IL-2, as described in Smith et al.. Proc. 

25 Natl. Acad. Set. USA (1985) 82:8404; mouse 1L-3. as described in Miyajima et al., 
Gene (1987) 55:273; and human glucocerebrosidase, as described in Martin et al.. 
DNA (1988) 7:99. 

Numerous bacuioviral strains and variants and corresponding permissive insect 
host cells from hosts such as Spodaptera frugiperda (caterpillar), Aedes aegypti 
30 (mosquito), Aedes albopiaus (mosquito), DrosophUa melanogaster (fruitfty), and 

Bombyx mod host cells have been identified and can be used herein. See, for example, 
the description in Luckow et al.. Bio/Technology( 1 9S8) 6:47-55 t Miller et al. in 
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GENETIC ENGINEERING (Setlow, J.K. et al. cds ), Vol. 8 (Plenum Publishing. 
1986), pp. 277-279, and Macda et al.. Nature. (1985) 5/5:592-594. A variety of such 
viral strains are publicly available, e.g., the L-l variarit of Autographa californica 
NPV and the Bm-5 strain of Bombyx mod NPV. Such viruses may be used as the 
5 virus for transection of host cells such as Spodoptera pugiperda ceils. 

Other baculovirus genes in addition to the polyhedrin promoter may be 
employed to advantage in a baculovirus expression system. These include immediate- 
early (alpha), delayed-early (beta), late (gamma), or very late (delta), according to the 
phase of the viral infection during which they are expressed. The expression of these 

10 genes occurs sequentially, probably as the result of a "cascade" mechanism of 

transcriptional regulation. Thus, the immediate-early genes are expressed immediately 
after infection, in the absence of other viral functions, and one or more of the resulting 
gene products induces transcription of the delayed-early genes. Some delayed-early 
gene products, in mm, induce transcription of late genes, and finally, the very late 

15 genes are expressed under the control of previously expressed gene products from one 
or more of the earlier classes. One relatively well defined component of this 
regulatory cascade is IEI, a preferred immediate -early gene of Autographo californica 
nuclear poly hedro sis virus (AcMNPV). IEI is expressed in the absence of other viral 
functions and encodes a product that stimulates the transcription of several genes of the 

20 delayed-early class, including the preferred 39K gene, as described in Guarino and 
Summers, /. Virol. (1986) 5*563-571 and J, Viral. (1987) 67:2091-2099 as well as 
late genes, as described in Guarino and Summers, Virol. (1988) 762:444-451. 

Immediate-early genes as described above can be used in combination with a 
baculovirus gene promoter region of the delayed-early category. Unlike the 

25 immediate-early genes, such delayed-early genes require the presence of other viral 
genes or gene products such as those of the immediate-early genes. The combination 
of immediate-early genes can be made with any of several delayed-early gene promoter 
regions such as 39K or one of the delayed-early gene promoters found on the Hindlll 
fragment of the baculovirus genome. In the present instance, the 39K promoter region 

30 can be linked to the foreign gene to be expressed such that expression can be further 
controlled by the presence of IEI, as described in L. A. Guarino and Summers 
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(1986a), cited above; Guarino & Summers (1986b) J. VirvL. (1986) 60:215-223, and 
Guarino et al. (1986c). J. Virol (1986) 60:224-229. 

Additionally, when a combination of immediate-early genes with a delayed- 
early gene promoter region is used, enhancement of the expression of heterologous 
5 genes can be realized by the presence of an enhancer sequence in direct cis linkage 
with the delayed -early gene promoter region. Such enhancer sequences are 
characterized by their enhancement of delay ed-early gene expression in situations 
where the immediate-early gene or its product is limited. For example, the hr5 
enhancer sequence can be linked directly, in cis, to the delayed -early gene promoter 

10 region, 39K, thereby enhancing the expression of the cloned heterologous DNA as 
described in Guarino and Summers (1986a), (1986b), and Guarino et al. (1986). 

The polyhedrin gene is classified as a very late gene. Therefore, transcription 
from the polyhedrin promoter requires the previous expression of an unknown, but 
probably large number of other viral and cellular gene products. Because of this 

15 delayed expression of the polyhedrin promoter, state-of-the-art BEVs, such as the 
exemplary BEV system described by Smith and Summers in, for example, U.S. Pat. 
No. , 4,745,05 1 will express foreign genes only as a result of gene expression from the 
rest of the viral genome, and only after the viral infection is well underway. This 
represents a limitation to the use of existing BEVs. The ability of the host cell to 

20 process newly synthesized proteins decreases as the baculovirus infection progresses. 
Thus, gene expression from the polyhedrin promoter occurs at a time when the host 
cell's ability to process newly synthesized proteins is potentially diminished for certain 
proteins. As a consequence, the expression of secretory glycoproteins in BEV systems 
is complicated due to incomplete secretion of the cloned gene product, thereby trapping 

25 the cloned gene product within the cell in an incompletely processed form. 

While it has been recognized that an insect signal sequence can be used to 
express a foreign protein that can be cleaved to produce a mature protein, the present 
invention is preferably practiced with a mammalian signal sequence. 

An exemplary insect signal sequence suitable herein is the sequence encoding 

30 for a Lepidopteran adirx>kinetic hormone (AKH) peptide. The AKH family consists of 
short blocked neuropeptides that regulate energy substrate mobilization and metabolism 
in insects. In a preferred embodiment, a DNA sequence coding for a Lepidopteran 
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Manduca scxta AKH signal peptide can be used. Other insect AKH signal peptides, 
such as those from the Orthoptera Schistocerca gregaria locus can also be employed to 
advantage. Another exemplary insect signal sequence is the sequence coding for 
Drosophiia cuticle proteins such as CPI, CP2, CP3 or CP4. 
5 Currently, the most commonly used transfer vector that can be used herein for 

introducing foreign genes into AcNPV is pAc373. Many other vectors* known to 
those of skill in the art, can also be used herein. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in a kit form 
from companies such as Invitrogen (San Diego CA) ("MaxBac" kit). The techniques 
10 utilized herein are generally known to those skilled in the art and are fully described in 
Summers and Smith, A MANUAL OF METHODS FOR BACULOVIRUS VECTORS 
AND INSECT CELL CULTURE PROCEDURES. Texas Agricultural Experiment 
Station Bulletin No. 1555, Texas A & M University (1987); Smith et aL, Moi Cell. 
BioL (1983) J:2I56, and Luckow and Summers (1989). These include, for example, 
15 the use of pVL985 which alters the polyhedrin start codon from ATG to ATT, and 
which introduces a BamHl cloning site 32 base pairs downstream from the ATT, as 
described in Luckow and Summers, Virology (1989) 77:31. 

Thus, for example, for insect cell expression of the present polypeptides, the 
desired DNA sequence can be inserted into the transfer vector, using known 
20 techniques. An insect cell host can be cotransformed with the transfer vector 

containing the inserted desired DNA together with the genomic DNA of wild type 
baculovirus, usually by cotransfection. The vector and viral genome are allowed to 
recombine resulting in a recombinant virus that can be easily identified and purified. 
The packaged recombinant virus can be used to infect insect host cells to express the 
25 desired polypeptide. 

Other methods that are applicable herein are the standard methods of insect cell 
culture, cotransfection and preparation of plasmids are set forth in Summers and Smith 
(1987), cited above. This reference also pertains to the standard methods of cloning 
genes into AcMNPV transfer vectors, plasmid DNA isolation, transferring genes into 
30 the AcmMNPV genome, viral DNA purification, radiolabeling recombinant proteins 
and preparation of insect ceil culture media. The procedure for the cultivation of 
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viruses and cells are described in Volkman and Summers, /. Virol, (1975) /P:82(>832 
and Volkman. et al., J. Virol. ( 1976) /0:82O-832. 

Expression in Mammalian Ceils 
5 The polypeptides of the present invention can be expressed in mammalian cells, 

such as HeLa cells, using promoters and enhancers that are functional in those cells. 
Synthetic non-natural promoters or hybrid promoters can also be used herein. For 
example, a T7T7/T7 gene promoter can be constructed and used, in accordance with 
Chen a al. t Nucleic Acids Res. 22;1\ 14-2120 (1994), where the T7 polymerase is 

10 under the regulatory control of its own promoter and drives the transcription of the 
inserted coding sequence, which is placed under the control of another T7 promoter. 
Also suitable for use herein is the gene for the CCAAT/enhancer-binding protein 
C/EBPct, as described in Birkenmeiere/a/., Genes Dev. (1989) 5:1 146-1156. 

Typical promoters for mammalian cell expression include the S V40 early 

15 promoter, the CMV promoter, the mouse mammary tumor virus LTR promoter, the 
adenovirus major late promoter (Ad MLP), and the herpes simplex virus promoter, 
among others. Other non-viral promoters, such as a promoter derived from the murine 
metallothionein gene, will also find use in mammalian constructs. Mammalian expression 
may be either constitutive or regulated (inducible), depending on the promoter. Typically, 

20 transcription termination and polyadenylaiioa sequences will also be present, located 3' to 
the translation stop codon. Preferably a sequence for optimization of initiation of 
translation, located 5' to the polypeptide coding sequence, is also present. Examples of 
transcription tenmnator/poryadenyiauon signals include those derived from S V40, as 
described in Sambrook et al (1989), cited previously. Introns, containing splice donor 

25 and acceptor sites, may also be designed into the constructs of the present invention. 

Enhancer elements can also be used herein to increase expression levels of the 
mammalian constructs. Examples include the SV40 early gene enhancer, as described in 
Dijkema etcd. , EMBO J. (1985) 4:761 and the enhancer/promoter derived from the 
long terminal repeat (LTR) of the Rous Sarcoma Vims, as described in Gorman ctaL, 

30 Proc. Natl. Acad. Set. USA (1982b) 7*6777 and human cytomegalovirus, as 

described inBosharte/ aL. Ceil (1985) */:52L A leader sequence can also be present 
which includes a sequence encoding a signal peptide, to provide for die secretion of the 
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foreign protein in mammalian cells. Preferably, there are processing sices encoded 
between the leader fragment and the gene of interest such that the leader sequence can 
be cleaved either in vivo or in vitro. The adenovirus tripartite leader is an example of 
a leader sequence that provides for secretion of a foreign protein in mammalian cells. 

There exist expression vectors that provide for the transient expression in 
mammaiian cells of DNA encoding the target polypeptide. In general, transient 
expression involves Che use of an expression vector that is able to replicate efficiently 
in a host cell, such that the host cell accumulates many copies of the expression vector 
and, in turn, synthesizes high levels of a desired polypeptide encoded by the expression 
vector. Transient expression systems, comprising a suitable expression vector and a 
host cell, allow for the convenient positive identification of polypeptides encoded by 
cloned DNAs, as well as for the rapid screening of such polypeptides for desired 
biological or physiological properties. Thus, transient expression systems are 
particularly useful for purposes of identifying analogs and variants of the target 
polypeptide that have target polypeptide-ltke activity. 

Once complete, the mammalian expression vectors can be used to transform any 
of several mammalian cells. Methods for introduction of heterologous polynucleotides 
into mammalian cells are known in the art and include dextran-mediated transection, 
calcium phosphate precipitation, polybrene mediated transection, protoplast fusion, 
electropo ration, encapsulation of the polynucleotides) in liposomes, and direct 
microinjection of the DNA into nuclei. General aspects of mammalian cell host 
system transformations have been described by Axel in U.S. Patent No. 4,399,216. 

Mammalian cell lines available as hosts for expression are also known and 
include many immortalized cell lines available from the American Type Culture 
Collection (ATCQ, including but not limited to, Chinese hamster ovary (CHO) ceils, 
HcLa cells, baby hamster kidney (BHK) cells, monkey kidney cells (COS), human 
hepatocellular carcinoma cells (e.g.. Hep G2), human embryonic kidney cells, baby 
hamster kidney cells, mouse Sertoli cells, canine kidney cells, buffalo rat liver cells, 
human lung cells, human liver cells, mouse mammary tumor cells, as well as others. 

The mammalian host cells used to produce the target polypeptide of this 
invention may be cultured in a variety of media. Commercially available media such 
as Ham's F10 (Sigma), Minimal Essential Medium ([MEM], Sigma), RPMI 1640 
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(Sigma), and Dulbecco's Modified Eagle's Medium ([DMEMI, Sigma) arc suitable for 
cuituring the host cells. In addition, any of the media described in Ham and Wallace, 
Meth. Em. (1979) 5A44, Barnes and Sato, Anal Biochem. (1980) 702:255. U.S. 
Patent Nos. 4,767,704, 4,657,866. 4,927,762, or 4,560.655. WO 90/103430, WO 
5 87/00195, and U.S. RE 30,985, may be used as culture media for the host ceils. Any 
of these media may be supplemented as necessary with hormones and/or other growth 
factors such as insulin, transferrin, or epidermal growth factor, salts (such as sodium 
chloride, calcium, magnesium, and phosphate), buffers (such as HEPES), nucleosides 
(such as adenosine and thymidine), antibiotics (such as Gentamycin" M drug), trace 

10 elements (defined as inorganic compounds usually present at final concentrations in the 
micro molar range), and glucose or an equivalent energy source. Any other necessary 
supplements may also be included at appropriate concentrations that would be known 
to those skilled in the art. The culture conditions, such as temperature. pH, and the 
like, are those previously used with the host cell selected for expression, and will be 

15 apparent to the ordinarily skilled artisan. 

The collision construct can be introduced into host cells by conventional 
techniques including lipofectamine, DEAE-dextran, electroporation, and calcium 
phosphate, and as described above. 

For use for in screening inhibitors, a stable cell line that contains the collision 

20 construct can be made and selected. For example, the collision construct is 

electropo rated together with a selectable marker gene, for example neomycin. G418 
resistant colonies are assayed for the existence and functionality of the collision 
construct The cell line can be prokaryotic or eukaryotic in origin. Preferably, the 
cell line is eukaryotic, more preferably, mammalian. In a preferred embodiment, the 

25 cell line can be derived from HeLa cells. T-cdls, B-cells and 293 cells. 

The stable cell line containing the collision construct can also be cotrarofected 
with a plasmid containing a binding protein. The coding sequence for the binding 
protein can be inserted into an expression plasmid, such as pCG, a pEVRF derivative, 
described in Giese et al., Gates £ Development (1995) 0:995-1008. pEVRF is 

30 described in Matthias et d.. Nucleic Adds Res. (1989) 17:641$. pCG has a modified 
polylinker, and directs expression in mammalian cells from the human cytomegalovirus 
promoter/enhancer region. The coding sequence for the binding protein can also be 
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inserted into the expression plasmki pCDNA (Oontech, Palo Alio, CA). The DNA 
construct encoding the binding protein can be made by standard methods of 
recombinant DNA technology as described in Sambrook et al. (1989) MOLECULAR 
CLONING: A LABORATORY MANUAL, 2nd ed. (Cold Spring Harbor Press, Cold 
Spring Harbor, N.Y.) and Ausubel et al., cited previously. 

Alternatively, the collision construct can be iransfected into a cdl line that 
constimtively expresses a binding protein, such as, for example, HeLa cells, T-ceUs, 
B-ceils or 293 cells mat have been stably transfected with a vector that directs 
expression of the binding protein. 

Further alternatively, the host cell carrying the collision construct for use in 
screening can be exposed to a binding protein that is added into the medium containing 
the transfected cells. In a preferred embodiment, a Tat protein expression vector is 
added to a cell line carrying the collision construct. The amount of the expression 
vector can be varied depending upon the extent of inhibition of reporter gene activity 
desired. It is desirable to work in the range of about 50% to 9056 AP activity, 
preferably 60% to 80% AP activity in the absence of activation of the second 
regulatory sequence and in the range of about 10% to 50%, preferably 20% to 40% 
AP activity, in the presence of activation of the second regulatory sequence. 

Thus, in using the host cell that contains the collision construct for screening 
inhibitors, the reporter gene activity is determined in the absence of the binding 
protein, in the presence of the binding protein, and in the presence of candidate 
inhibitors being screened. A candidate that increases reporter gene activity in the 
presence of a binding protein that activates transcription, for example, can be selected 
and further tested as an inhibitor to transcriptional activation. 

In another embodiment of the present invention, kits can be made that contain 
the present collision construct for screening for inhibitors of transcriptional activation. 
Such kits can include vectors or host cells containing one or more of die present 
collision constructs in suitable containers, along with the reagents and materials 
required for the conduct of the assay or descriptions of those remaining reagents 
necessary, as well as a suitable set of assay instructions. Otter materials or reagents 
can include, for example, diluents, buffers, host cells and other reagents, appropriate 
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containers such as tubes, plates, etc.. and may be included in the kit. or described in 
the instructions. 

The present invention will now be illustrated by reference to the following 
examples which set forth particularly advantageous embodiments. However, it should 
5 be noted that these embodiments are illustrative and are not to be construed as 
restricting the invention in any way. In particular, other promoters or response 
elements and other reporter gene can be substituted for the ones described herein. 

Example I 

10 Construction of the Collision Construct with CMV and HfV-1 Promoters 

and an Alkaline Phosphatase Reporter Gene: Construct #1152 
In one embodiment of the present invention, the collision construct containing 
the human cytomegalovirus ("hCMV") promoter, the gene for the secreted form of the 
human placental heat-stable alkaline phosphatase CAP") and the promoter of the 

15 human immunodeficiency virus- 1 ("HIV-1 was generated from precursor constructs 
as described below. 

A nucleotide sequence comprising the hCMV promoter and a region derived 
from the herpes simplex thymidine kinase, <&. gene for the optimal initiation of 
translation (hereafter "the tk upstream region*), was isolated from plasmid pCG. 

20 Plasmid pCG, described in Giese et al.. Gates and Development (1995) 9:995-1008, is 
a pEVRF derivative, as described in Matthias et al.. cited previously. pCG has a 
modified poiylinker, and directs expression in mammalian cells from the human 
cytomegalovirus promoter/enhancer region. The hCMV promoter was isolated from 
pCG by digestion with restriction enzymes EcoRl and Xbal (Boehringer Mannheim, 

25 Germany). Restriction digestion for purposes herein was conducted essentially as 

described in Sambrook et al„ cited previously, and Ausubel et aL, cited previously* or 
in accordance with the manufacturer's recommendations. For example, digestions 
were typically conducted using 2 pi of 10 x restriction buffer, 0. 1 to 4 jig of DNA in 
water or TE buffer, 1-5 U of enzyme per ug of DNA and water to obtain a total 

30 volume of 20 ul The components of the digest were incubated at 37°C from about 10 
minutes to overnight, depending on the amount of DNA being digested. 
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The hCMV promoter sequence isolated from pCG was then ligated into plasmid 
pTZ19U, purchased from Pharmacia (Piscataway, N.J.), that had been cleaved with 
the same restriction enzymes, EcoRl and Xbal. The resulting plasmid was designated 
construct #1080. Ligation reactions herein were essentially performed as directed by 
5 the manufacturer of the ligase (Boehringer Mannheim, Germany) and along the 
principles described in Sambrook et al., cited previously, and Ausubd et al„ cited 
previously. Briefly, approximately 10 to 100 fempto moles (1C 15 ) of vector DNA 
were ligated with 3 to 10 fold molar excess of insert DNA in a final volume of 20 ul 
using T4 DNA ligase (Boehringer Mannheim, Germany) at 16°C from about 10 

10 minutes to overnight, depending on the amount of DNA being ligated. 

The coding region of the alkaline phosphatase gene was isolated from plasmid 
pSEAP-Basic, purchased from Clontech (Palo Alto, CA) by restriction with fft/tdill 
and Sail and ligated into plasmid Bluescript, purchased from Stratagene (La Jolla, CA) 
mat had been cleaved with the same enzymes. The resulting plasmid, designated 

15 construct #1067, was cleaved with Qa\ and Sail, the 5 '-overhangs were filled in by 
Klenow enzyme, (Boehringer Mannheim, Germany) and the ends were religated. Fill- 
in reactions described herein were conducted as directed by the manufacturer of the 
Klenow enzyme (Boehringer Mannheim, Germany). The resulting plasmid construct 
was designated #1074. This manipulation restored the Sail restriction site. 

20 The coding region of the alkaline phosphatase gene was isolated from construct 

#1074 by restriction with Xbal and Sail and ligated into construct #1080 that had been 
cleaved with the same enzymes. This ligation resulted in an intermediate recombinant 
plasmid containing an AP gene that was out-of-frame with respect to the tk region. 
For production of an in-frame fusion, the intermediate recombinant plasmid containing 

25 the AP sequence was cleaved with Xbal and HindlH, the 5'-overhangs were filled-in 
and the ends were religated. This manipulation also restored the Xbal site. The 
resulting plasmid was designated construct #1 1 12. 

The HIV-1 promoter was isolated from plasmid pHIVSCAT. as described in 
Selby and Peteriin. Cell (1990) 62:769-776, by treatment with Aspl\% and /foam 

30 The isolated fragment was ligated into plasmid Bluescript from Stratagene (La Jolla, 
CA) t that had been cleaved with the same enzymes. The resulting plasmid was 
designated construct #1075. The particular HIV-1 promoter in pHIVSCAT contains 
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several point mutations that have been introduced to create restriction enzyme 
recognition sites. Comparison of the mutant promoter in pHlVSCAT with the wild- 
type HIV- 1 promoter did not show any significant differences in activity. The 
sequence of the mutant HIV-1 promoter is listed in FIG. 5. The HIV-1 promoter was 
5 isolated from construct #1075 by restriction with £coRV and then ligated into plasmid 
pTZ18U from Pharmacia (Piscataway, NJ), that had been cleaved with Stool. The 
resulting plasmid was designated construct #1 149. 

To generate die final collision construct of this example, the HIV- 1 promoter 
was isolated from construct 01 149 by restriction with Soil and AsplXt and ligated into 

10 construct #1 112 that was cleaved with the same enzymes. The resulting plasmid was 
designated construct Ml 152, the collision construct. 

In the collision construct, the direction of transcription from the hCMV and that 
from the HIV-l promoter were in opposite directions. The distance between the end of 
the AP coding region, as defined by the stop codon TAA, and the sun of transcription 

15 in the HIV- 1 promoter, as defined by + 1 of the promoter sequence was about 213 
nucleotides. 

Example 2 

HIV Tat Protein Dependent Reduction of Alkaline Phosphatase Activity 
20 HeLa cells were transiently transfected with the following: (1) 1 ug of plasmid 

01 152, the collision construct, and (2) various amounts of plasmid pSV7fd7TAT: 0 
ug, 0. 1 ug, 0.3 ug, 0.5 ug, and 1.5 ug, respectively. Plasmid pSV7fd/TAT is herein 
referred to as the Tat expression plasmid, and its construction is described bdow. The 
amount of DNA in each transection assay was kept constant by adding Tat-tnactivc 
25 plasmid, the construction of which is also described below. Results are shown in FIG. 
1 . For transient transection of HeLa cdls described herein, lipofectarmne (purchased 
from BRL, Gaithersburg, MD) was used in accordance to the rnanufacturers* 
instructions. For transfections hereafter, except as expressly provided otherwise, 1 ug 
of the collision construct was used together with cither 0.5 ug of Tat expression 
30 plasmid (pSV7fd/TAT) and/or 0.5 ug of Tat-hiactrve plasmid. After approximately ft 
hours, the celts were washed and incubated in fresh DMB medium, supplemented with 
10% fetal calf serum. About 16-20 hours after transection, aliquots of die supernatant 
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were analyzed for alkaline phosphatase activity according to the manufacturers' 
conditions (Oontech, Palo Alto, CA). 

The Tat expression plasmid and the Tat Inactive plasmid were constructed or 
used as follows. Plasmid pS V7fd/TAT was obtained from Peterlin at University of 
5 California at San Francisco and was constructed and used as described in SeJby and 
Peterlin, Cell (1990) 62:769-776. Plasmid pSV7fd/TAT contains the coding region 
for the transcriptional activator Tat from HIV-1. In this plasmid, Tat expression is 
under control of the SV40 early promoter. Inactive Tat expression plasmid was 
generated by restriction of plasmid pSV7fd/TAT with Xbal. The DNA ends were 

10 ' filled-in and the plasmid reiigated. this procedure generated a frame shift mutant that 
resulted in a premature stop codon and no functional Tat protein expression. The 
resulting construct is referred to as Tar-inactive plasmid, and was used herein to keep 
the total amount of DNA in each transection constant. 

FIG. 1 shows a reduction of AP activity mat was dependent on the amount of 

IS Tat expression plasmid added. Over the range of 0 to 2 ug of Tat expression plasmid, 
reporter gene activity decreased from about 100% to about 25%, resulting in about 
75% inhibition at the highest level tested. Reporter activity was about 40% when Tat 
was present at a level of between about 0.3 to 0.5 ug of Tat expression vector. At this 
level, mere is almost no nonspecific effect of Tat protein on CM V promoter activity. 

20 

Example 3 

N Study of the Dependence of TAR on Tat for Reduction in AP Activity Using Deletion 
Constructs Derived from Construct #11 52 

To examine the dependence of the presence of the TAR sequence in the HIV-1 
25 promoter for the observed Tat-dcpendent reduction in AP activity in the previous 

example, the following deletions were introduced into construct #1 152: (1) deletion of 
a major portion of the TAR sequence, construct #1161; (2) deletion of the entire TAR 
sequence, construct #1225; (3) deletion of the entire TAR sequence and the TATA 
box. construct #1 162; and (4) deletion of the entire TAR sequence, the TATA box, 
30 and the three Spl binding sites, construct #1163. 
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Construct #1 152 was restricted with BgM and BamHl and the ends reiigated. 
The resulung plasmid represents construct #1161. This deletion removes about 36 
nucleotides of the TAR sequence. 

Complete removal of die TAR sequence was obtained by deleting nucleotides 
5 from position -14 to +59, by PCR using DNA of construct #1152 as template and 
primer #613 (5 '-GCXS AAGCr TTGCAGCTGCTTAT ATGCAGCA -3 ') and reverse 
primer, purchased from New England Biolabs, Beverly, MA. The underlined 
sequence represents nucleotides -35 to -13 of the HIV-1 promoter beginning with 
Hindlll restriction site; the norKinderiined portion includes the Hindlll restriction site 
10 that begins after the nucleotides GCG at the 5'-cnd. The resulung DNA fragment was 
digested with Hindlll and Asp718 and reiigated into construct #1 152 cleaved with the 
same enzymes. This manipulation also removed the start of transcription and 
generated construct #1225. 

A third deletion construct was prepared from construct #1 152 by restriction 
15 with Xbal and religation. The resulting plasmid is designated construct #1 162. This 
deletion removes the complete TAR sequence and the region containing the TATA box 
sequence. 

A fourth deletion construct was prepared from construct #1 152 by restriction 
with Sma\ and BamHl and the ends reiigated. The resulting plasmid is designated 

20 construct #1163. This deletion removes the complete TAR sequence, the TATA box 
sequence and the three Spl binding-site sequences. 

FIG. 2 shows the results of measuring the specific reduction of alkaline 
phosphatase expression in the presence of 0.5 ug HIV-1 Tat protein expression plasmid 
in the deletion constructs #1 161 , #1225, #1 162 and #1 163, as compared with construct 

25 #1 152. For construct #1 152, in the absence of Tat, activity of the AP gene was set to 
100%. In the presence of Tat, activity of the A? gene was reduced to 36% ± 7%. 
Thus, in the presence of Tat, expression of AP was reduced by about 64%. For 
construct #1161, in which most the TAR sequence was deleted, expression of the AP 
gene in the absence of Tat was about 55%. Addition of Tat only reduced AP gene 

30 expression to about 53%, indicating that inhibition of reporter gene activity by Tat 
protein is TAR sequence dependent Construct #1225, comaining a deletion in the 
TAR sequence, produced about 150% AP gene activity in the absence of Tat, and 
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about 140% AP gene activity in the presence of Tat. Construct #1 162, containing a 
deletion in the TAR sequence and in the TATA box, produced about 150% AP gene 
expression in the absence of Tat protein, and about 153% AP gene expression in the 
presence of Tat Construct #1 163, containing deletions in the TAR sequence, the 
TATA sequence and the Spl binding sites, generated a still higher level of AP gene 
expression of about 160% in the absence of Tat, and about 150% in the presence of 
Tat. These results demonstrate the requirement for a functional TAR sequence and a 
TATA box sequence for activation of the HfV-1 promoter. In addition, significant 
up regulation of HIV-l promoter activity by Tat protein requires a functional TAR 
sequence. 

Example 4 

Effect of Various Deletions of the HIV-l Promoter on Transcriptional Activation and 
Construction of TAT-lnactive Plasmid 
Other constructs were made to test the effect of deletion of portions of a 
promoter region on transcriptional activation, using the AP gene as the reporter gene. 
Construct #1085 was made using the mutant HIV-l promoter as described in Example 
3, as an Asp718fHindIU DNA fragment isolated from pHIVSTAT and ligated into 
plasmid pSEAP-Basic (Clontech, Palo Alto, CA) that was cleaved with the same 
enzymes. This operation linked the HIV-l promoter to the AP gene. Constructs 
#1 166, #1213, #1 167 and #1168 were made by isolating the HIV- 1 promoter deletion 
constructs from construct #1161, #1215, #1162 and #1163, described above, and 
ligating them as Asp7WHmdIlI fragments into plasmid pSEAP-Basic. Thus, construct 
#1085 contains the entire HIV-l promoter. Construct #1166 lacks about 36 
nucleotides of the TAR sequence. Construct #1213 lacks all of the TAR sequence and 
nucleotides comprising the original start of transcription. Construct #1 167 lacks the 
TAR sequence and the TATA box. Construct #1 168 lacks the TAR sequence, the 
TATA box, and the three Spl binding sites. HeLa cells were transiently transfected 
with the latter constructs as described before and AP gene expression was observed as 
described before. 

FIG. 3 shows that in the absence of Tat, the full-length HIV-l promoter in 
construct #1085, containing the TAR sequence, was unable to induce significant 
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expression of the AP gene, resulting in about 2% AP gene activity. In the presence of 
0.5 jig protein expression plasmid Tat. construct #1085 activated AP gene expression. 
The AP gene activity produced by construct #1085 was set to 100%. Thus, in the 
presence of Tat, there was approximately, a 50-fold activation. Deletion construct 
5 #1 166 that lacked most of the TAR sequence showed only about 2% AP activity in the 
presence of Tat protein. However, in the absence of Tat protein, the truncated HIV-1 
promoter produced about 12% AP activity. Deletion constructs #1213, #1 162 and 
#1 163 showed no basal and also no Tat-mducible promoter activity. 

10 Example 5 

Constructs to Study the Effect of Spacing on the Function of the Collision Construct 
Constructs having varying distances or spacer regions between the first and 
second promoters or between the second promoter and reporter gene were made to 
study the effect of spacing on the function of the collision construct. 

15 The various collision constructs made contained spacer regions of 21 

nucleotides (construct # 1 190), 94 nucleotides (construct #1 181), 153 nucleotides 
(construct #1 187), 406 nucleotides (construct #1 188), 556 nucleotides (construct 
#1 189) and 2047 nucleotides (construct #1 159), positioned between the 3* end of the 
AP coding sequence, as defined by the stop codon, and the end of the TAR sequence at 

20 +59 nucleotide in the HIV-l promoter, with + 1 nucleotide as the start of 

transcription. Those constructs were made as follows: Construct #1 190 was made by 
restriction digest of construct #1 152 with Hpal and Hindlll and religatkm. This 
manipulation changed the stop codon from TAA to TGA. Construct #1181 was made 
by restriction digest of construct #1 152 with Sail and Hindlll and retigation. 

25 Constructs #1187, #1188 and #1189, respectively, were made by insertion of parts of 
the PEBP2a coding region, as described in Ogawa a ai. (1993) Proc, Nad. Acad. Sa. 
USA 90:6859^6863, as a Sacl/Hindlll DNA fragment, an Asp7lS/Hindlll DNA 
fragment or a Ncol/HindJH DNA fragment, respectively, into construct #1 152 cleaved 
with Sail which was blunt-ended and Hindlll. Construct #1159 was made by insertion 

30 of the htciferase gene isolated from plasmid pT3/T7-Luc (Oontech, Palo Alto, CA) as 
a Sail/Asp718 fragment into construct #1 152 cleaved with the same enzymes. Results 
are shown in FIG. 4. 
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FIG. 4 illustrates Chat collision, inhibition, or AP expression is dependent on 
the spacing between the HIV-1LTR and the reporter gene, or alternatively, on the 
spacing between the first promoter and the second promoter. In the presence of 0.5 ug 
of Tat expression plasmid, AP activity in constructs with spacing from about 21 
nucleotides to about 556 nucleotides is between 40% and 46%, and thereafter, as the 
spacing increased to about 2047 nucleotides, the alkaline phosphatase activity 
increased, indicating a certain space requirement for collision. Transient transections 
were done using 1 ug of collision construct DNA and 0.5 ug of either active or 
inacdve Tat expression plasmid. These results illustrate that collision is dependent on 
the spacing between the HIV-1 LTR and the first regulatory sequence that included the 
reporter gene. 



Example 6 

HeLa Cells Stably-Transfected with Tat Protein Expression Plasmid 
HeLa cells were transfected with 10 ug of either active Tat expression plasmid 
or with Tat-inactive plasmid together with 1 ug of plasmid pSVNeo (purchased from 
Clontech, Palo Alto, CA ) for selection by electroporation using a BioRad Gene Pulser 
(Purchased from BioRad. Hercules, CA). Electroporation was conducted in 
accordance with the manufacturer's instructions. For example, the conditions for 
electroporation are 1000 uF and 300 volts at room temperature in a final volume of 
500 ui medium containing 10% fetal calf serum, and 50 ug/ml each of penicillin and 
streptomycin. Stable Tat expression colonies were identified by resistance to 400 
ug/ml gentamicin (purchased from GIBCO BRL) after about 10 to 14 days. Stable 
colonies were picked, amplified, and analyzed for Tat protein expression by 
uansfection with construct #1085. the plasmid with an HIV-1 promoter linked to the 
AP reporter gene. Positive Tat-expressing cell lines were identified by measuring 
alkaline phosphatase activity, according to directions described by the manufacturer of 
pSVNeo (Clontech, Palo Alio. CA). 



Example 7 

Assay to Screen for Inhibitors of Transcriptional Activation 
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The following assay is designed to screen for inhibitors of the Tat protein, the 
Tat/TAR interaction or any other HIV- 1 promoter target The inhibition sought is 
identified by significant inhibition of long terminal repeat (LTR) activity. Cell lines 
are stably transfected as described in Example 6 above with the collision construct 
5 #1152 and the Tat plastnid. Those stabry transfected cell lines that produce a steady- 
state alkaline phosphatase activity of about ~ 30% to 40% compared to control HeLa 
cells stably transfected with only the collision construct are selected for screening. 
Screening for inhibitors is conducted as follows: inhibitors are introduced into the 
culture medium and, after about 16 to 20 hours, the supernatant is analyzed for 

10 alkaline phosphatase activity, as described in Example 6. Using a 96-wd! assay plate, 
for example, 12 different inhibitors, at 8 different concefurations, are tested 
simultaneously. Those inhibitors that produce an increase in alkaline phosphatase 
activity are further characterized by transient transfection experiments. Transient 
transfections are conducted as described in Example 2. For example, HeLa cells are 

15 transiently transfected with construct #1 152 and a controlled amount of Tat plasmtd in 
the presence of an inhibitor. The inhibitor is then tested for dose responsiveness to Tat 
by titrating the inhibitor or the Tat expression plasmid. Separate transient transfection 
assays are repeated for each inhibitor selected by the screening process described 
above. 

20 

i 

Example 7 
Constructs Including TAR Decoys 
To prove the functionality of the collision construct in identifying inhibitors of 
25 HIV-l transcription by an increase in AP reporter gene activity, we constructed vectors 
containing mulumerized TAR sequences (a schematic of which is indicated in FIG 6). 
Overexpression of TAR-containing sequences, referred to as TAR decoys have proved 
sufficient at inhibiting HIV-l promoter activity by squelching Tat-mediated 
transactivauon, as described in Sullenger ct at Cell 63: 601-608 (1990) and Graham 
30 andMaio, Proc. NatH Acad Set USA 87: 5817-5821 (1991). The ability of Tar decoys 
to block HIV-l transcription was previously analyzed with an HIV-AP construct in 
which the reporter gene was placed under the regulatory control of the HIV-l 
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promoter, as schematically represented in FIG 6a. Transfcction of the HIV-AP 
plasmid together with a Tat expression vector into HeLa cdls showed approximately a 
45-fold stimulation of reporter gene expression relative to the level detected in the 
absence of Tat. Addition of the TAR decoys almost completely abolished the 
5 transactivation function of Tat. Next, the ability of the TAR decoys to inhibit HIV-1 
transcription in the context of the collision construct, as represented schematically in 
FIG. 6b. The results showed that the addition of TAR decoys blocked the potential of 
Tat to repress reporter gene expression by counter-transcription. At high TAR decoy 
concentrations, AP activity was even higher than the level detected in the absence of 

10 Tat, as shown in lanes 4 and 5 of FIG. 6b. These data suggest taht the TAR decoys 
sequester not only Tat but also other cellular factors that bind to the TAR sequence. 
This result is further supported by the result schematically depicted in a portion of FIG 
1 that indicated that AP values detected with a ATAR construct are similar to the 
values obtained with the wild type collision construct in the presence of the TAR 

15 decoys. 

The plasmid pBJ-TARS was constructed by insertion of multimerized TAR 
sequences isolated from pHIVSCAT with Xbal and Hindlll restriction enzymes 
(nucleotides -40 to +59) downstream of the Sra promoter in pBJ. as described in 
Takebe et al Mai Cell. Biol. 8: 466-472 (1988). As a control plasmid. the ieptin 
20 gene, as described in Giese et al Embo J. 12:4667-4676 (1993) with a similar size 
compared to the TAR sequences was inserted into the pBJ vector. 
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WHAT IS CLAIMED IS : 

1 . A collision construct comprising a nucleic acid molecule that comprises: 



promoter; 

wherein the first promoter is different from the second promoter, and 
10 direction of transcription under the first promoter is opposite to direction of 
transcription under the second regulatory sequence, and 

wherein regulation of the second promoter alters reporter gene signal. 

2. The collision construct of claim 1, wherein the second promoter 
15 comprises a first response element that is capable of specifically binding to a first 
binding protein to form a first binding pair; and formation of the first binding pair 
regulates the second promoter under transcription-regulating conditions. 



3. The collision construct of claim I , wherein the regulation of the second 



20 promoter is achieved by activation thereof. 

4. The collision construct of claim 1 , wherein the second promoter 
comprises a 5* terminus and a 3* terminus and the reporter gene is separated from 3* 
terminus of the second promoter by a distance of less than about 2047 nucleotides. 



5. The collision construct of claim 4, wherein the distance is in a range 
selected from the group of ranges, in nucleotides, of about 1-50. 51-100, 101-150, 
151-200, 201-300, 301-400, 401-500, 501-600, 601-1000, 1001-1500. 1501-2200. 



5 



a) 
b) 



a first regulatory sequence that comprises a first promoter; 
a reporter gene that is under transcriptional control of the 
first promoter; and 

a second regulatory sequence that comprises a second 



c) 



30 



6. The collision construct of claim 4, wherein the distance is in a range 
selected from the group of ranges, in nucleotides, of about 1-20, 21-40, 41-60, 61 -SO, 
81-100, 101-120, 121-140, 141-160, 161-250, 251-425, 426-550. 
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7. The collision construct of claim 4, wherein the distance, in nucleotides, 
is selected from the group consisting of about 21. 94, 153, 406, 556 and 2047. 

5 8. The collision construct of claim 1 , wherein the first promoter is a 

minimal promoter. 



9. The collision construct of claim 1 , wherein one or both of the first and 
second promoters are each selected from the group consisting of promoters derived 
10 from a virus, a bacteriophage, a prokaryodc gene, and an eukaryouc gene. 



10. The collision construct of claim 9, wherein the virus is selected from the 
group consisting of a retrovirus, a vaccinia virus, a herpes virus, a hepatitis virus, a 
papilloma virus, an adenovirus, and an adeno- associated virus. 

15 

1 1 . The collision construct of claim 1 , wherein the first promoter is selected 
from the group consisting of a Xpl promoter, a Xnt promoter, a prokaryotic ribosomal 
RNA P1/P2 promoter, a Rous Sarcoma Virus promoter, a Simian Virus 40 promoter, 
a simian irruriunodeficiency virus promoter, an albumin promoter, a Ick promoter, and 

20 a fos promoter. 



12. The collision construct of claim 9, wherein second promoter is a 
promoter derived from a virus and the virus is selected from the group consisting of a 
cytomegalovirus, a herpes simplex virus, a hepatitis virus, and a human 

25 irnmunodeficiency virus. 

13. The collision construct of claim 9, wherein the second promoter is 
selected from the group consisting of a fit promoter, a CD4 promoter, and a £-3 
promoter. 

30 

14. The collision construct of claim 1, wherein one or both of the first 
regulatory sequence and second regulatory sequence comprise a synthetic sequence. 
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15. The collision construct of claim 14, wherein the first regulatory 
sequence comprises a synthetic sequence and the synthetic sequence is selected from 
the group consisting of a multimeric Gal4 binding site linked to a minimal promoter 
and a LexA binding site linked to a minimal promoter. 

16. The collision construct of claim 14, wherein the synthetic sequence 
comprises a TATA box. 

17. The collision construct of claim 1, wherein the first promoter comprises 
a second response element mat is capable of specifically binding to a second binding 
protein to form a second binding pair, wherein formation of the second binding pair 
regulates the first promoter under transcription-regu lating conditions, and the second 
binding protein is incapable of specifically binding to the first promoter. 

18. The collision construct of claim 1, wherein the reporter gene is selected 
from the group consisting of genes encoding alkaline phosphatase, luciferase, 
chloramphenicol acetyl transferase, p-galactosidase, (^-glucuronidase, and green 
fluorescent protein. 

19. The collision construct of claim 2, wherein the first response element is 
derived from a promoter or promoter/enhancer region of a gene selected from the 
group consisting of a viral gene, a bacteriophage gene, a prokaryotic gene, and an 
eukaryotic gene. 

20. The collision construct of claim 17, wherein the second response 
element is derived from a promoter or promoter/enhancer region of a gene selected 
from the group consisting of a viral gene, a bacteriophage gene, a prokaryonc gene 
and an eukaryotic gene. 



WO 97/10360 



-43- 



PCT/US96/1JS45 



21. The collision construct of claim 2, wherein the first response element is 
one selected from the group consisting of a transactivation response element ("TAR"), 
Rev response element ("RRE"), a NFhcB binding site, and a Spl binding site. 

5 22. The collision construct of claim 17, wherein the second response 

element is one selected from the group consisting of a transacdvation response element 
("TAR"), Rev response element ("RRE"), a NF-kB binding site, and a Spl binding 

site. 

10 23. The collision construct of claim 2, wherein the first binding protein is 

one selected from the group consisting of Tat, Rev, NF-kB, and Spl. 

24. The collision construct of claim 1 , wherein the first promoter has a 
strength of transcription that is approximately the same as that of the second promoter 

15 upon activation. 

25. A vector comprising the collision construct of claim 1, further 
comprising a nucleotide sequence that allows for expression of the collision construct 
in a host cell. 

20 

26. A host cell comprising the vector of claim 25. 

27. The host cell of claim 26, wherein the host cell is capable of amplify ing 
the collision construct or effecting the expression thereof. 

25 

28. The host cell of claim 26, wherein the cell is selected from the group 
consisting of a prokaryotic cell and an eukaryotic cell. 



29. The host cell of claim 27, wherein the cell is capable of amplifying the 
30 collision construct and is a prokaryotic cell. 
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30. The host cell of claim 27, wherein the cell is capable of effecting the 
expression of the collision construct and is an eukaryotic cell. 

31. A kit comprising the collision construct of claim 1, further comprising 
5 instructions for use thereof. 



32. A kit comprising the vector of claim 25. further comprising instructions 
for use thereof. 

10 33. A kit comprising the host cell of claim 26, further comprising 

instructions for use thereof. 



34. A method for screening a candidate inhibitor for its ability to inhibit 
transcription under a target promoter comprising: 

15 a) providing a ceil that comprises the collision construct of 

claim 1 , wherein the second promoter is the target promoter; 

b) determining reporter gene signal in the presence and absence 
of the candidate inhibitor; and 

c) comparing reporter gene signals obtained to determine 
20 whether inhibition of transcription under the second 

promoter occurred in the presence of the candidate inhibitor. 

35. The method of claim 34, wherein the target promoter is not endogenous 
to the cell. 

25 

36. The method of claim 34, wherein the target promoter is endogenous to 
the cell. 



37. A method for screening a candidate inhibitor for its ability to inhibit 
30 binding between a target binding protein and a target response element comprising: 
a) providing a cdl that comprises the collision construct of 

claim 2, wherein the first binding protein is the target 



WO 97/10360 



-45- 



PCT/US96/13845 



binding protein and the first response element is the target 
response element; 
b) providing a baseline reporter gene signal in the absence or 

presence of the target binding protein; 
5 c) deterrnining reporter gene signal in the presence and absence 

of the candidate inhibitor; and 
d) comparing reporter gene signals obtained. 



38. The method of claim 37, wherein the target response element is not 
10 endogenous to the cell. 



39. The method of claim 37, wherein the target response element is 
endogenous to the cell. 

IS 40. The method of claim 37, wherein the target binding protein is provided 

by a process selected from the group consisting of: 

a) introducing into the cell a nucleotide sequence mat encodes 
die target binding protein; 

b) allowing a cell that is capable of producing the target 

20 binding protein constitutivdy to produce the target binding 

protein; and 

c) adding the target binding protein to the cell. 



41 . A method for identification of an inhibitor of transcription under a target 
25 promoter comprising: 

a) providing a cell that comprises the collision construct of 
claim 1, wherein the second promoter is the target promoter; 

b) deterrnining reporter gene signal in the presence and absence 
of a panel of candidate inhibitors; 

30 c) comparing reporter gene signals obtained to determine if 

inhibition of target promoter activity has occurred in the use 
of any one of the panel of candidate inhibitors. 
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42. A method for making a reporter collision construct, comprising: 

a) providing a first regulatory sequence that comprises a first 
promoter, a reporter gene that is capable of providing a 

5 detectable signal upon transcription and translation, and a 

second regulatory sequence that comprises a second 
promoter; and 

b) linking the first regulatory sequence, the reporter gene, and 
the second regulatory sequence together to produce the 

10 collision construct of claim 1 . 

43. A method for production of a collision construct, comprising culturing 
the host cell of claim 33. 

15 44. A collision construct produced by a process comprising expressing the 

collision construct of claim 1 in a prokaryotic or eukaryouc cell. 



20 



45. The collision construct of claim 44, wherein the eukaryouc cell is 
selected from the group consisting of a mammalian cell, an insect cell, a yeast cell, and 
an avian cell. 
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GACGCTGACCGCGGGGGC6GCCGT6GTGGCTGCGGCGCGT6GGCCCAATT6GGCACCA6G 

9 KAS1 NARI # 18 NAEI. 41 SMAI XHAI, 46 HPAI , 

61 CC6CGTTGCTTCCTCTGCTGGCCGGGACATCAGGTGGCCCCCGCTGAATTGGAATCGTCG 
GGCGCAACGAAGGAGACGACCGGCCCTGTAGTCCACCGGGGGCGACTTAACCTTAGCAGC 

117 SALI . 

121 ACTCTAGAGGATCCCCATCAA6CTTGCAT6CCTGCAGGTC6ACTCTAGAGGATCCCCATC 
TGAGATCTCCTA6GGGTA6TTCGAACGTACG6ACGTCCAGCTGAGATCTCCTA6GG6TAG 

123 XBAI. 129 BAHH1. 130 BSAB1. 140 HIND3. 116 SPHI. 151 SSE 
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181 AAGCTTTATT6AG6CTTAA6CAGTG66TTCCCTA6TTAGCCAGAGAGCTCCCAG6CTCA6 
TTCGAAATAACTCCGAATTCGTCACCCAAGGGATCAATCGGTCTCTC6AGGGTCCGAGTC 

a. a a 

181 HIND 3-195 AFL2, 225 SACI, 239 BGLD, 

24 1 ATCTGGTCTAACCAGAGAGACCCAGTGCAT6CAAAAAGCAGCTGCTTATATGCAGCATCT 
TAGACCAGAnGGTCTCTCT6GGTCAC6TACGTTTTTCGTCGACGAATATACGTCGTAGA 
^♦1 U CC 

267 SPHI, 279 PYU2, 298 XBAI, 

301 AGAGGGCAC6CCAaCCCCAGTCCC6CCCAG6aAC6CCTCCCGGGAAAGTCCCC 
TCTCCCGTGCGGT6AGGGGTCAGGGCGGGTCC6GTGCGGAGG6CCCTTTCA6GGG 

Insertion * iA 

341 SMAI XHAI* 

361 AAAGTCCCTTGGAGAAAGCTC6AT6TCAGCA6TCTTT6TA6TACTCC6GAT6CA6CTCTC 
mCAGG6AACacmCGA6aAa6TCGTa6AAACATCATGAGGCCTACGTCGA6AG 

A A 

400 SCAi. 405 BSPE1, 

421 GGGCCAT6TGAT6AAAT6CTAGTTT6CTGTCAAACCTC(^CACTAACACnCTTTCTCCG 
CCWGTACACTACTnACGATCAAACGAWGlTTGGAGGTGTGATTGTGAAGAAAGAGGC 

481 CGTCCTCCATCCaTGaGGCTCATAGGGTGTAACAAGCTGn6nCTCTCCnCAn66 
GCAGGAGGTAGGGTACGTCCGAGTATCCCACATT6TTCGACAACAAGAGAGGAAGTAACC 

541 aTCnCTACCnCTCTG6aaACTG6TAqAG(TrGAAGaCCATCCAAA6GTCA6TG 
GGA6AAGATG6AAGAGACCGAGn6ACUTGATC6AACrTCGTG6TA6GnTCCAGTCAC 

601 GATGGGTACCGA6CTC6AATTCCCTATAGTGAGTC6TATTAAATTCGTAATCA 
CTACCCAT6GCTCGA6CnAA6GGATATCAaCA6CATAAnTAA6CAnA6T 
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